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Foreword 



The present book aims to present some improved estimators using auxiliary and 
attribute information in case of simple random sampling and stratified random sampling and 
in some cases when non-response is present. 

This volume is a collection of five papers, written by seven co-authors (listed in the 
order of the papers): Sachin Malik, Rajesh Singh, Florentin Smarandache, B. B. Khare, P. S. 
Jha, Usha Srivastava and Habib Ur. Rehman. 

The first and the second papers deal with the problem of estimating the finite 
population mean when some information on two auxiliary attributes are available. In the third 
paper, problems related to estimation of ratio and product of two population mean using 
auxiliary characters with special reference to non-response are discussed. 

In the fourth paper, the use of coefficient of variation and shape parameters in each 
stratum, the problem of estimation of population mean has been considered. In the fifth 
paper, a study of improved chain ratio-cum-regression type estimator for population mean in 
the presence of non-response for fixed cost and specified precision has been made. 

The authors hope that the book will be helpful for the researchers and students that are 
working in the field of sampling techniques. 
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A Generalized Family Of Estimators For Estimating Population 
Mean Using Two Auxiliary Attributes 

1 Sachin Malik, +l Rajesh Singh and 2 Florentin Smarandache 

'Department of Statistics, Banaras Hindu University 
Varanasi-221005, India 

2 Chair of Department of Mathematics, University of New Mexico, Gallup, USA 
f Corresponding author, rsinghstat@gmail.com 



Abstract 

This paper deals with the problem of estimating the finite population mean when some 
information on two auxiliary attributes are available. A class of estimators is defined which 
includes the estimators recently proposed by Malik and Singh (2012), Naik and Gupta (1996) 
and Singh et al. (2007) as particular cases. It is shown that the proposed estimator is more 
efficient than the usual mean estimator and other existing estimators. The study is also 
extended to two-phase sampling. The results have been illustrated numerically by taking 
empirical population considered in the literature. 

Keywords Simple random sampling, two-phase sampling, auxiliary attribute, point bi- 
serial correlation, phi correlation, efficiency. 



1. Introduction 

There are some situations when in place of one auxiliary attribute, we have 
information on two qualitative variables. For illustration, to estimate the hourly wages we can 
use the information on marital status and region of residence (see Gujrati and Sangeetha 
(2007), page-311). Here we assume that both auxiliary attributes have significant point bi- 
serial correlation with the study variable and there is significant phi-correlation (see Yule 
(1912)) between the auxiliary attributes. The use of auxiliary information can increase the 
precision of an estimator when study variable Y is highly correlated with auxiliary variables 
X. In survey sampling, auxiliary variables are present in form of ratio scale variables (e.g. 
income, output, prices, costs, height and temperature) but sometimes may present in the form 
of qualitative or nominal scale such as sex, race, color, religion, nationality and geographical 
region. For example, female workers are found to earn less than their male counterparts do or 
non-white workers are found to earn less than whites (see Gujrati and Sangeetha (2007), page 
304). Naik and Gupta (1996) introduced a ratio estimator when the study variable and the 
auxiliary attribute are positively correlated. Jhajj et al. (2006) suggested a family of 
estimators for the population mean in single and two-phase sampling when the study variable 
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and auxiliary attribute are positively correlated. Shabbir and Gupta (2007), Singh et al. 
(2008), Singh et al. (2010) and Abd-Elfattah et al. (2010) have considered the problem of 
estimating population mean Y taking into consideration the point biserial correlation 
between auxiliary attribute and study variable. 



2. Some Estimators in Literature 

In order to have an estimate of the study variable y, assuming the knowledge of the 
population proportion P, Naik and Gupta (1996) and Singh et al. (2007) respectively, 
proposed following estimators: 



_( 
ti = y 

V 






t 3 = yexp 



p l -Pi ^ 
p l +Pl 



U 



( 

- yexp 

v 



Vl P 2 

P2 + P 2 J 



( 2 . 1 ) 



(2.2) 



(2.3) 



(2.4) 



The Bias and MSE expression’s of the estimator’s L (i=l, 2, 3, 4) up to the first order of 
approximation are, respectively, given by 



B(t,)=Yf,Cj i [l-K p J 



B(t 3 )=Yf, 





B(t 4 )= Yfj 





MSE(t l ) = Y : f l [c;+C; i (l-2K p J] 
MSE(t 2 )= Yf,[cJ +Cp i (l + 2K pb; )] 



(2.7) 



( 2 . 8 ) 

(2.9) 

( 2 . 10 ) 
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MSE(t J ) = Y 2 f 1 C?+C;fi-K 



( 2 . 11 ) 



MSE(t 4 ) = Y f, Cj +Cp f-l + K pb 

V 4 



(2.12) 



“tee, 



c =— C =— • (i = 12) 

S S ’ y V ’ p j P ’ lJ ’ h 
Y r j 



Cy Cy 

K P b i =Ppt>! 71 ’ K pb 2 = Ppb 2 ^ 

^Pi ^P 2 



^ <t , l < t>2 



1 n ^ s 

“PiX^i ~P 2 ) Pet = — '~ L ~ be the sample phi-covariance and phi- 

n -!i=i “ “ s . s 



correlation between (j)j and (j) 2 respectively, corresponding to the population phi-covariance 



1 N 

and phi-correlation =— - £(<!>„ - P, X4» 2i ~ p 2 ) 

IN 1 i-i 



andp, = 



KK 



Malik and Singh (2012) proposed estimators ts and tr, as 



- f Pj } f P, 

t 5 = y — — 

IpJ Ip 2 



(2.13) 



- fPi-p,Y fp^- p 0 

t 6 =yexp- L -^ exp — 

l p i+pj Ip 2 + p J 



(2.14) 



where a,,a 2 ,P l andP 2 are real constants. 

The Bias and MSE expression’s of the estimator’s t 5 and t 6 up to the first order of 
approximation are, respectively, given by 



B (t5 ) — YP 1 C P| -~ _ + ^““ _a l k pb l + C p, “Y“ + UT -a2k P b 2 + a l a 2 k <|> 



(2.15) 
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B(t 6 ) = Yf 1 C 2 [ — — — K pb 1 + C 2 f — + — K pb -MIr, 
b 1 Pi 4 2 p 1 P2 4 2 p 2 4 q 



MSE(t 5 ) = Y 2 f i [C; + C’ (a? - 2 ai K pb[ )+ C J 2 (a’ - 2a 2 K pb2 + 2 a[ a 2 K J] 



MSE(t 6 ) = Y 2 f 1 C ;+C* ^-P,K pbi +C; 2 fi_Ml K(p+ P 2 K pbi 



(2.16) 



(2.17) 



(2.18) 



3. The Suggested Class of Estimators 

Using linear combination of t ; (i = 0,1,2), we define an estimator of the form 

3 

t P =Z w i l i eIi 



Such that, X w i -1 and Wj e R 
i=0 

Where, 

— — L,P + L ? L,P 7 + L . 

t o = y 5 t 1 =y^ J — - — i 

+l 2 J L l 3P2+l 4 



and ,, = expf (L » P - + L »> - (L P- + ^ TeJ (L ** + L «> - (L ^ + 

(LjP, +L 2 ) + (L 5 p, + L 6 )_ (L7P2 + L2 ) + (^7^2 + L 8 )_ 

where w ; (i = 0,1,2) denotes the constants used for reducing the bias in the class of 
estimators, H denotes the set of those estimators that can be constructed from t, (i =0,1,2) 
and R denotes the set of real numbers (for detail see Singh et. al (2008)). Also, 
Lj(i = 1,2,...,8) are either real numbers or the functions of the known parameters of the 
auxiliary attributes. 

Expressing t p in terms of e’s, we have 



w o +w i ( 1 + ( Pi e i)" ai ( 1 + ( P2 e 2)" 
t p =Y(l + e 0 ) +w 2 exp(-0 1 e 1 [l + e i e 1 f 1 ) P ' 

exp(-0 2 e 2 [l + 9 2 e 2 ] ') P ' 



where, 
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L,P, + L-, 



L 3 P 1+ L 4 

e F5 

' 2[l,P !+ L 6 ] 

q _ L 7 P 2 
2 2[L,P,tL,] 

After expanding. Subtracting Y from both sides of the equation (3.3) and neglecting the term 
having power greater than two, we have 

(tp -Y)=Y[e 0 -w 1 (a 1 cp 1 e 1 +a 2 cp 2 e 2 )-w 2 (p 1 0 1 e 1 -p 2 0 2 e 2 )] 



Squaring both sides of (3.4) and then taking expectations, we get MSE of the estimator t up 
to the first order of approximation, as 

MSE(t p )= Y 2 f[wfT t + w 2 T 2 +2w,w 2 T 3 -2w x T 4 -2w 2 T 5 ] 



where, 



L 2 L 4 l,l, 

w. = 5 V 1 

l,l 2 -l 2 3 

LiLr L 3 L 4 
w 2 =— 1 — V- 

l,l 2 -l 2 3 



Lj cpjOjCp^ +cp 2 a 2 C p2 + 2a 1 a 2 (pi ( P2^cp^'p 2 

L2=0, 2 Pf<+0 2 p 2 c 2 i -2p i P 2 c Pl 0 2 k (p C 2 2 

L 3 = alp 1 0 1 C’ -a 2 P 2 0 2 C 2 2 +a 2 p 1 (p 2 0,k (p Cp 2 -cqcp^p^C 2 

L 4 = <VPik pb C 2 +a 2 tp 2 k pb2 C 2 2 

L 5 =Pi0 1 k pbl Cj | -P 2 0 2 k pb2 C 2 2 
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4. Empirical Study 

Data: (Source: Government of Pakistan (2004)) 

The population consists rice cultivation areas in 73 districts of Pakistan. The variables 
are defined as: 

Y= rice production (in 000’ tonnes, with one tonne = 0.984 ton) during 2003, 

P| = production of farms where rice production is more than 20 tonnes during the year 2002. and 
P 2 = proportion of farms with rice cultivation area more than 20 ha during the year 2003. 

For this data, we have 

N=73, Y =61.3, P, =0.4247, P 2 =0.3425, Sj=12371.4, =0.225490, S; =0.228311, 

Ppb,= 0 - 621 > P P b 2 =°- 673 > P*=0-889. 

Table 4.1: PRE of different estimators of Y with respect to y . 



CHOICE OF SCALERS, when w 0 - 0 w : - 1 w 2 - 0 


a, 


a 2 


Li 




^3 


l 4 


PRE’S 


0 


1 






1 


0 


179.77 


1 


0 


1 


0 






162.68 


1 


1 


1 


1 


1 


1 


156.28 


-1 


1 


1 


0 


1 


0 


112.97 


1 


1 


C r, 


Ppbi 


Cp 2 


Ppb 2 


178.10 


1 


1 


NP, 


K p bl 


np 2 


K pb2 


110.95 


-1 


1 


NP, 


f 


np 2 


f 


112.78 


-1 


1 


N 


K p bl 


N 


K pb2 


112.68 


-1 


1 


NP, 


Pi 


NP 2 


P 2 


112.32 


1 


1 


n 


P, 


n 


P 2 


115.32 


-1 


1 


N 


Ppbi 


N 


Ppb 2 


112.38 


-1 


1 


n 


Pi 


n 


P 2 


113.00 


-1 


1 


N 


Pi 


N 


P 2 


112.94 


When, w o = 0 w, = 0 w 2 = 1 
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5. Double Sampling 

It is assumed that the population proportion Pi for the first auxiliary attribute <j)| is 
unknown but the same is known for the second auxiliary attribute 4>2 • When Pi is unknown, it 
is some times estimated from a preliminary large sample of size n'on which only the 
attribute (j)jis measured. Then a second phase sample of size n (n<n') is drawn and Y is 
observed. 

Let p'=-i;Mj=i- 2 >- 

n i=i 

The estimator’s ti, t 2 , t 3 and U in two-phase sampling take the following form 
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t d3 =Y ex P 



Pi ~Pi 
P 1 +P 1 



t d4 = y ex P 



P 2 ~ P 2 

P 2 + P 2 



The bias and MSE expressions of the estimators tdi, td 2 , td 3 and td 4 up to first order of 
approximation, are respectively given as 



B(t d ,)=Yf,C; [l-K pb ] 



B(u) = Yf,^-[l-K^] 



B(t d4 ) = Yf 3 i y?-[n-K pb2 



MSE(t dl )=Y : [f l C;+f 3 Cj ] (l-2K pl| )] 



MSE(t d! )=Y : [f,C ! y+ f ; C; i (l*2K k(i )] 



(5.10) 



MSE(t dJ )=Y 2 f,C* +f 3 -^ L (l-4K pli ) 



(5.11) 



MSE(t d4 )=Y 2 fiCy + f 3 — n-(l + 4K pb| ) 



where, 



S h ~ p j) , S <l>j ’ 1 ^(^ji p j 

n - 1 :_i n -1 i=i 



(5.12) 



f - 1 1 f - 1 1 

t 2-~ » f 3- “• 

n N n n 
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The estimator’s U and t6, in two-phase sampling, takes the following form 



td 5 =y 



(5.13) 



( - y'i f - y* 2 

Pi “Pi P2 “?2 

t d6 ^ Y ex P 1 — e x P — — A 



Pl+Pl 



P2 + P 2 



(5.14) 



Where mj , m 7 , n, and n, are real constants. 



The Bias and MSE expression’s of the estimator’s t d5 and t d6 up to the first order of 
approximation are, respectively, given by 



m: m, ) , ( m; m. 



B(t d5 ) = Y| f 3 c;_ ^ ~ m ‘ Kpb ' r ^ + 



(5.15) 



B(tJ=Y f 3 ( — + — - — K . jc 2 +f 2 [Hi + ^ + ^K 

V do / 3 o o nt pbi Pi 2 ci a ^ 



8 8 2 



8 8 2 



MSE(t d5 ) = +f 3 C; i (m 1 2 -2 mi K pbi )+f 2 Cj 22 (m^ -2m 2 K pb2 i 



MSE(t d6 ) = Y 2 f lC 2 y +f 3 ^-n,K pb| Cj_ +f 2 ^ + n 2 K pb2 C 2 



(5.16) 



(5.17) 



(5.18) 



6. Estimator t p d in Two-Phase Sampling 

Using linear combination of t di (i = 0,1,2), we define an estimator of the form 

3 

t P d=S h . t d 1 eH 

i=0 

3 

Such that, 7h, = 1 and hj e R 

i=0 

where, 



mi 

— — L , p , +L n. L + L 

t = v t = v — — 

i o y ’ L di y 



LiPt +L 2 J |^L 3 p 2 +L 



and t d2 = exp {L ^^ +L ^ (L Pi +L 6) exp (L 7 p^ 2 +L 6 ) (L 7 P 2 +L 8 ) 

L^iP'i+L 2 ) + (L5Pi + L 6 ) J L(L7P'2 + L2) + (L7B2 +L 8 )_ 

where h ; (i = 0,1,2) denotes the constants used for reducing the bias in the class of estimators, 
H denotes the set of those estimators that can be constructed from t di (i = 0,1,2) and R 



17 




Rajesh Singh ■ Florentin Smarandache (editors) 



denotes the set of real numbers (for detail see Singh et. al. (2008)). Also, L,(i = 1,2,...,8) are 
either real numbers or the functions of the known parameters of the auxiliary attributes. 

Expressing t p d in terms of e’s, we have 

t p =Y(l + e 0 )[h 0 +h 1 (l + (p 1 e' 1 ) mi (l + (p 1 e 1 )" ni (l + (p 2 e' 2 )" m2 

+ h 2 exp(e, [e', -e, ][l + 0,(6'! -e, )] 1 f exp(0 2 e' 2 [l + 0 2 e' 2 ])" 2 (6 3) 

After expanding, subtracting Y from both sides of the equation (6.3) and neglecting the 
terms having power greater than two, we have 

(tpd - y)= Y[e 0 +h,(m 1 (p 1 e' 1 -m 1 (p 1 e 1 -m 2 (p 2 e' 2 ) + h 2 (n 1 0 1 e' 1 -n 1 0,e 1 +n 2 0 2 e' 2 )] 

(6.4) 



Squaring both sides of (6.4) and then taking expectations, we get MSE of the estimator t p up 
to the first order of approximation, as 



MSE(t pd )= Y 2 [fqR, + h 2 R 2 + 2h,h 2 R 3 + 2h,R 4 + 2h 2 R 5 ] 

1 R,R 2 -R 3 2 

where, R,R 5 -R 3 R 4 

2 r,r 2 -r 2 



(6.5) 



(6.6) 



Ri =( p 2 mff 3 C pi +cp 2 m 2 f 2 C p2 

R 2 =0fn 1 2 f3C;+0^n^f 2 C; 2 

R 3 = m 2 n 2 f 2 cp 2 0 2 C P2 - njmjtpjOj^k.pC^ ^ 

R 4 =- m i ( Pif3 k pb 1 Cj 1 -m 2 (p 2 f 2 k pb2 C p2 

R 5 = - n i 0 i f 3 k pb, C p 1 + n 2 0 2 f 2 k P b 2 c; 2 

Data: (Source: Singh and Chaudhary (1986), p. 177). 



The population consists of 34 wheat farms in 34 villages in certain region of India. The 
variables are defined as: 



y = area under wheat crop (in acres) during 1974. 

p, = proportion of farms under wheat crop which have more than 500 acres land during 1971. 
and 
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p 0 = proportion of farms under wheat crop which have more than 100 acres land during 1973. 
For this data, we have 

N=34, Y =199.4, P, =0.6765, P 2 =0.7353, S 2 y =22564.6, =0.225490, S; =0.200535, 

Ppb^ 0599 ’ P P b 2 =°- 559 > P*=0-725. 



Table 6.1: PRE of different estimators of Y with respect to y 



CHOICE OF SCALERS, when h o - 0 h, - 1 h 2 - 0 


m i 


m 2 


Li 


L 2 


L 3 


l 4 


PRE’S 


0 


1 






1 


0 


108.16 


1 


0 


1 


0 






121.59 


1 


1 


1 


1 


1 


1 


142.19 


1 


1 


1 


0 


1 


0 


133.40 


1 


1 


C r, 


Ppbi 


C P, 


Ppb 2 


144.78 


1 


1 


NP, 


Kp bl 


np 2 


K pb2 


136.90 


1 


1 


NP, 


f 


np 2 


f 


133.30 


1 


1 


N 


Kp bl 


N 


K pb2 


135.73 


1 


1 


NP, 


Pi 


NP 2 


P 2 


137.09 


1 


1 


n 


Pi 


n 


P 2 


138.23 


1 


1 


N 


Ppbl 


N 


Ppb 2 


135.49 


1 


1 


n 


p, 


n 


P 2 


138.97 


1 


1 


N 


p, 


N 


P 2 


135.86 


When, h o -° h i -0h 2 -1 


n i 


n 2 


l 5 


L 6 


l 7 


^8 


PRE’S 


1 


0 


1 


0 


1 


0 


130.89 


0 


-1 


1 


0 


1 


0 


108.93 


1 


-1 


1 


0 


1 


0 


146.63 


1 


-1 


1 


1 


1 


1 


121.68 


1 


-1 


1 


1 


1 


0 


127.24 


1 


-1 


C P, 


Ppb, 


C p, 


Ppb 2 


123.43 


1 


-1 


NP, 


K p bl 


np 2 


K pb2 


145.49 


1 


-1 


NP, 


f 


np 2 


f 


146.57 


1 


-1 


N 


Kp bl 


N 


K pb2 


145.84 


1 


-1 


NP, 


Pi 


np 2 


P 2 


145.43 


1 


-1 


n 


Pi 


n 


P 2 


145.03 
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1 


-1 


N 


Ppb, 


N 


Ppb, 


145.92 


1 


-1 


n 


P> 


n 


P 2 


144.85 


1 


-1 


N 


Pi 


N 


P 2 


145.80 


When, h 0 -Ohj -0h 2 -1 also L ; (i - 1,2,...,8)- 1 
m i =m 2 = n, =n 2 =l PRE(t pd , 


=154.28 



7 . Conclusion 

In this paper, we have suggested a class of estimators in single and two-phase 
sampling by using point bi serial correlation and phi correlation coefficient. From Table 4.1 
and Table 6.1, we observe that the proposed estimator t p and t p d performs better than other 
estimators considered in this paper. 
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Abstract 

This paper deals with the problem of estimating the finite population mean when some 
information on auxiliary attribute is available. It is shown that the proposed estimator is more 
efficient than the usual mean estimator and other existing estimators. The results have been 
illustrated numerically by taking empirical population considered in the literature. 

Keywords Simple random sampling, auxiliary attribute, point bi-serial correlation, ratio 
estimator, efficiency. 



1. Introduction 

The use of auxiliary information can increase the precision of an estimator when 
study variable y is highly correlated with auxiliary variable x. There are many situations 
when auxiliary information is available in the form of attributes, e.g. sex and height of the 
persons, amount of milk produced and a particular breed of cow, amount of yield of wheat 
crop and a particular variety of wheat (see Jhajj et. al. (2006)). 

Consider a sample of size n drawn by simple random sampling without replacement 
(SRSWOR) from a population of size N. Let yj and (jr denote the observations on variable y 
and (/) respectively for i th unit ( i =1, 2, , N). 

Let 4> j =1; if the i th unit of the population possesses attribute 6 = 0; otherwise. 

N n 

Let A= Yj ( I*i and a= ^6i , denote the total number of units in the population and sample 
i=l i=l 

respectively possessing attribute 6 • Let P=A/N and p=a/n denote the proportion of units in 
the population and sample respectively possessing attribute 6 • Naik and Gupta (1996) 
introduced a ratio estimator t NG when the study variable and the auxiliary attribute are 
positively correlated. The estimator t NG is given by 
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-P 

1 ng - y— 

p 



with MSE 



MSE(t NG ) = f ! (Sy +R 2 S| — 2RS y(j) ) 



N-n 



Y o 1 N / — \2 

where f 1= — — , R = — , si = — X(Yi “ Y j . 

Nn P y N-njti 



S J = ^ ifoi - ?) . S y * = ^ Sfe - P^Yi - Y) 



( 1 . 1 ) 



( 1 . 2 ) 



(for details see Singh et al. (2008)) 

Jhajj et. al. (2006) suggested a family of estimators for the population mean in single and two 
phase sampling when the study variable and auxiliary attribute are positively correlated. 
Shabbir and Gupta (2007), Singh et. al. (2008) and Abd-Elfattah et. al. (2010) have 

considered the problem of estimating population mean Y taking into consideration the point 
biserial correlation coefficient between auxiliary attribute and study variable. 

The objective of this article is to suggest a generalised class of estimators for population 

mean Y and analyse its properties. A numerical illustration is given in support of the 
present study. 



2. Proposed Estimator 

* 

Let (pi = (|>i + mA , m being a suitably chosen scalar, that takes values 0 and 1. Then 



q = p + mA = p + NmP , and 
Q = (Nm + 1)P, 



b B N n 

where q = -,Q = —,B= and b =Z c Pi- 

n N i=l i=l 



Motivated by Bedi (1996), we define a family of estimators for population mean Y as 



t = 



wiy + w 2 b(P-p)JJ^ 



where wy , w 2 and a are suitably chosen scalars. 

To obtain the Bias and MSE of the estimator t, we write 



( 2 . 1 ) 
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-Y(l + e 0 ) , P - P(l + e i ) , s| -S|(l + e 2 ) 



_S (bv ( 1 + e 3 ) ’ b -P ( 1 + e 3 X 1 + e 2 ) 



’<l>y (|)y 



such that E(ej ) = 0 , i=0, 1,2,3 and 



2x _[ 1_ J_V 2 

E( e 0 ) _ - NT Cy , 
In N ) 



c. 2x _( 1_ J_V 2 

E(ej ) — Cp , 

In N ) v 



( l 1 3 MM 

E( e o e l) = ~ Ppb^yCp, E(eie 2 ) = — C p A.Q 3 , 

in JN i in JN / 



E(e 1 e 3 ) = fi-l]c p ^2 

v n n; F p pb 



Expressing (2.1) in terms of e’s , we have 



t = Y w 1 (l + e 0 )-w 2 E-e 1 (l + e 3 Xl + e 2 ) 1 + — - 

R l Nm + 1 



Ci Ci 

We assume that \e 2 1 < 1 and — < 1 , so that ( 1 + e, ) ”' and 1 + — are expandable. 

Na + 1 ly Nm + 1) 

Expanding the right hand side of (2.2) and retaining terms up to second powers of e’s ,we 
have 



— — r L aei a(a-l) e? 

t-Y = Y[ Wl l + e 0 +— - L t+ \ 



aegei 



Nm + 1 2 (N m + lY Nm + 1 



P uc, 

wo — lei +eieo -eie 2 H >-l] 

R I 1 J 1 7 Nm + 1 



Taking expectation of both sides of (2.3) , we get the bias of t to the first degree of 
approximation as : 



B(t) = Y (wj-^+Wj 



■ f i C nv + ■ 



(a-l) 

: 



Nm + l 2(Nm + l) 



2 



c p — -a 03 +- — cl 

Z R 1 p p nh p Nm + 1 p 



Squaring both sides of (2.3) and neglecting terms of e’s having power greater than two, we 
have 
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(i-y) 2 =Y 2 w 2 jl + 2e 0 + 



2aej 2 ot(2a-l)e 2 daegej 
Nm + 1 0 (Nm + i) 2 Nm + 1 



2 fpf 2 , n P 2ae l 

+ w 2 - e i +1-2 wiw 2 — jet +e!e 3 +e oei ~ e l e 2 + Nm + 1 



ae, aeQej a(a-l)e 2 

— 2w i^l + Cn H 1 1 — 

Nm + l Nm + 1 2(Nm + l) 2 



Nm + 1 



+ 2w 2 - e 1 +e 1 e 3 -e 1 e 2 + 



Taking expectation of both sides of (2.5), we get the MSE of t to the first degree of 
approximation as: 

MSE(.) = Y 2 [l + w?A<$ + w?A 2 - 2w 1 w 2 A^ ) - 2w 1 A$ ) + 2w 2 A$ ) ] 



where, 



aH = 

Ha) 



i+fi q + 



a ^-p ( 2a — 1 



Nm + 1 1 Nm + 1 



Ao = — fic; 



a (m) P f n 2 \ 2a 

Ah \ = — ti L n -s bk 

3(a) R P Nm + 1 



aM, = 

A 4(a) 



+ X l2 -C p X 03 



Nm + 1 [2(Nm + l) 



A <+Uifi 2^£ 



R Nm + 1 



+ -C p X 03 

r\ . r 



where , k = p pb — . 

The MSE(t) is minimised for 

w (A+a-ASA^) 
UfeU, -(Afe\n 



\^l(a)^2 -(^3 (a)) ) 

( A ( m ) A ( m ) _ A A 'i 

( A 3(«) A 4(a) A l(a) A 5(a)/ . 

(A!r> 2 -(Att) J 
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3. Members of the family of estimator oft and their Biases and MSE 

Table 3.1: Different members of the family of estimators of t 



Choice of scalars 


Estimator 


Wi 


m 2 


a 


m 


1 


0 


0 


0 


tt =y 


Wi 


0 


0 


0 


t 2 = w x y 

Searls (1964) type estimator 


Wi 


0 


a 


m 


t 3 = w x y 


.qJ 


a 


m, 


0 


a 


0 


t 4 = w y 


[V 
Ip ; 


a 


1 


0 


-1 


0 


t5 =y 

Naik and 


V 

vP, 

Gu 


pta (1996) estimator 


1 


1 


-1 


0 


= [y +b(p-p)]— 

P 

Singh et. al. (2008) estimator 


Wj 


w 2 


0 


0 


i i 

'72 

l 

CM 

5= 

+ 

1 

ii 

r- 


Wj 


1 


0 


0 


tg = [w ! y + b(p - p) 


i] 


w 


w 


0 


0 


t 9 = w 


v r 
+ 
o' 

7 s 

1 

xs 




1 


1 


0 


0 


t to =y + b (P-p) 

Regression estimator 



The estimator tj = y is an unbiased estimator of the population mean Y and has the variance 
Var(t 1 )=f 1 s2 
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To, the first degree of approximation the biases and MSE’s of tj's, i=l,2,. 
respectively given by 



B(t 2 )=Y(w 1 -l) 



B(. 3 )=Y (w 1 -l) + f 1 w 1 fi^- I p pl ,C y C IJ+ ^ 



. a(q-i) r 2 
p 2(Nm + l) p 



B(t 4 )=Y (w 1 -l)+w l f 1 |ap pb C y C p +^ t fi-dic2 



B(t5 ) = Yf, [C p - p pb CyC p 



B(t 6 )=Yf, (C p - Ppb C y C p )--jH C p — -C p >. 



B(t 7 )=Y(w,-l)-^&f 1 C p hl- Cp X 03 

R F P^K ! 



B(tg ) = Y (w, - l)-Ff J C p -h=--C p ^Q3 
R I Ppb 



B(t,)=Y ( w -l)-«f,£ C p hl-Cp^ 0 , 

R I Ppb 



B(tlo) = -Y^f, c p 2n-c ^ 03 

R Ppb 



The corresponding MSE’s will be 

MSE(t 2 )=YT + wJ ; A<g ) -2w 1 A^ 0) 

MSE(t 4 ) = Y 2 [l + w ^ A ( - 2w , A ^ 
MSE(t 5 )=Y 2 [l + A<j’> 1) -2A< l j( 1) 



(3.13) 



(3.14) 
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L i(o) + 

L(o) . 

V A l(o) 



2 A (*)) - 2A^ + 2A ^ 

2A 3(-l) 2A 4(-1) + 2A 5(-1)J 


(3.15) 


2 A 2 ~ 2 w 1 w 2 A 3°o) “ 2w 1 A 1 ( (0) + 2 w 2 A 5(0). 


(3.16) 


_2w l( A 3°0) + A 4°0)) +2A 5(0). 


(3.17) 


^2 “ 2 A 3(0) ) _ 2 W ( A 4(0) ~ A 5(o)l 


(3.18) 


■ 2 ( A |o) +A 4°(o) _A 5(o)1 


(3.19) 



The MSE’s of the estimaors of h, i=2,3,4,7,8,9 will be minimised respectively, for 

a (°A 

w - 4(0) 

Wl ~7(^ 



(3.20) 



A *(\ 

... _ 4 (°0 



(3.21) 



A<j}\ 

4(a) 

Wl= T<r 



(3.22) 



Wj = 



W 2 = 



UoA^-A^A^M 1 

\ A 2 A 4 (o) A 3(0) A 5(0)/ 


A 2 A i( 0 ) 


y 




( A ( °) A (0) 


-A (0) A 


(0) ) 


\ a 3(0) a 4(0) 


A 1(0) A 


5(0)1 


a 2 a! 


_f A (0) ) 

l A 3(0) / 


2 



( A (°) 4. A (0) ) 

\ A 3(0) + A 4(0)) 



A (°) _a(») 

^4(0) ^-5(0) 

a(?), + A -2A^ 

A l(0) +A 2 ZA 3(0) 



(3.23) 



(3.24) 



(3.25) 



Thus the resulting minimum MSE of h , i= 2, 3, 4, 7, 8, 9 are, respectively given by 
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( A (°v 

tin. MSE(t 2 ) = Y 2 1-^ 

A l(0) 



(3.26) 



_ 2 aH 

tin. MSE(t 3 )=Y“ 1- 4 } a | 

1(a) 



(3.27) 



A (0) ^ 

tin. MSE(t 4 ) = Y 2 1 ® 



(3.28) 



tin. MSE(t 7 )= Y 2 1- 



A J A (0) f _2a(9W°A + A® 

A 2\ a 4 (o)) /A 3(0) A 5(0) + A l(0)\ A 5(0, 



t A 2 A l(0) A 3(0) 



(3.29) 



l A 3(0 A 4(0 / 



min. MSE(t 8 )=Y 2 1 + A 2 +2A^ ) 3(0) (o) 4(Q) ^ 



(3.30) 



\ A 4(0 A 5(0 / 



min M ep( t )_y 2 I 1 4 (°) W 

mm. MSE(t„)-Y () (o) 

A l(0) + A 2 2A 3(0) 



(3.31) 



4. Empirical study 

The data for the empirical study is taken from natural population data set considered 
by Sukhatme and Sukhatme (1970): 

y = Number of villages in the circles and 

t)) = A circle consisting more than five villages 

N = 89, Y = 3.36, P = 0.1236,p pb =0.766, C y = 0.604QC p =2.190 

^04 = 6.1619, X 40 =3.810, A. 12 = 146.475,7-03 = 2.27 44 



In the Table 4.1 percent relative efficiencies (PRE’s) of various estimators are computed with 
respect to y . 
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Table 4.1: PRE of different estimators of Y with respect to y . 



Estimator 


PRE’s 


ti =y 


100.00 


G 


101.41 


G 


90.35 


*4 


6.92 


G 


11.64 


^6 


7.38 


G 


100.44 


G 


243.39 


t 9 


243.42 


Go 


241.98 



Conclusion 

The MSE values of the members of the family of the estimator t have been obtained 
using (2.6). These values are given in Table 4.1. When we examine Table 4.1, we observe the 
superiority of the proposed estimators t2, ti, tx, t9 and tio over usual unbiased estimator ti, t3, 
U, Naik and Gupta (1996) estimator ts and Singh et. al. (2008) estimator t6. From this result 
we can infer that the proposed estimators tx and t9 are more efficient than the rest of the 
estimators considered in this paper for this data set. 

We would also like to remark that the value of the min. MSE(tlO), which is equal to the 
value of the MSE of the regression estimator is 241.98. From Table 4.1 we notice that the 
value of MSE of the estimators ts and t 9 are less than this value, as shown in Table 4.1. 
Finally, we can say that the proposed estimators ts and t 9 are more efficient than the 
regression estimator for this data set. 
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Abstract 

The auxiliary information is used in increasing the efficiency of the estimators for the 
parameters of the populations such as mean, ratio, and product of two population 
means. In this context, the estimation procedure for the ratio and product of two 
population means using auxiliary characters in special reference to the non response 
problem has been discussed. 

Keywords Auxiliary variable, MSE, non response, SRS, efficiency. 



Introduction 

The use of auxiliary information in sample surveys in the estimation of population 
mean, ratio, and product of two population means has been studied by different authors by 
using different estimation procedures. The review work in this topic has been given by 
Tripathi et al. (1994) and Khare (2003). In the present context the problems of estimation of 
ratio and product of two population means have been considered in different situations 
especially in the presence of non response. 

Estimation of Ratio and product of two population means 

Case 1. The Case of Complete Response: 

Singh (1965,69), Rao and Pareira (1968), Shahoo and Shahoo (1978), Tripathi (1980), 
Ray and Singh (1985) and Khare (1987) have proposed estimators of ratio and product of two 
population means using auxiliary characters with known mean. Singh (1982) has proposed 
the case of double sampling for the estimation of ratio and product of two population mean. 
Khare (1991(a)) has proposed a class of estimators for R and P using double sampling 
scheme, which are given as follows: 

R = f(v,u) and P = g(w,u) (1) 
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such that f(R,\) = R, g(P,l) = P, fi(R,l) = 1 and ^j(P,l) = 1, where v = — , w=y t y 2 

U 2 ) 

x, _ _ _ 

and u= — . Here y, , y 2 and x 1 denote the sample mean of study characters y, , y 2 and 

W 

auxiliary character x } based on a sub sample of size n (< ri) and x[ is sample mean of x 1 
based on a larger sample of size ri drawn by using SRSWOR method of sampling from the 
population of size N . The first partial derivatives of f(v,u ) and g(w,u) with respect to 
v and w are denoted by /, (v,u) and g, (w, u) respectively. The function f(v,u) and g(w,u) 
also satisfied some regularity conditions for continuity and existence of the functions. The 
sample size for first phase and second phase sample which may be from the first phase 
sample or independent of first phase sample drawn from the remaining part of the population 
(N -ri). 

Singh et al. (1994) have extended the class of estimators proposed by Khare (1991(a)) and 
proposed a new class of estimator for R, which is given as follows: 



R =Rh(u',v') 



where R = — , u = — and v' = -^-, where (x, s 2 x ) and (x',.y' t 2 ) are sample mean and sample 

y 2 x' s'; 

mean square of auxiliary character based on n and ri (> ri) units respectively. 

Srivastava et al. (1988,89) have suggested chain ratio estimators for R and P . Which 
are given as follows: 



R* x - R ^ U4 

UJU 



and Rl = — If —ri 



y 3 Ay 4 



f- (- 

P* = P Al A± and P* = P Al ^4 

UJ UJ ■ UJ UJ 



Further Singh et al. (1994) have given a general class of estimators 



R h = h(R,u, v)and P h = h[P,u,v), 



such that h{R,U) = R and h(P,\,\)- P. where u= Al andv = Al . The functions 

K y 3 J v Y4 , 

h{R,u,v)md h{P,u,v) satisfy the regularity conditions. 

Khare (1991(b)) have proposed the class of estimators for using multi-auxiliary 
characters with known means, which are given as follows: 



R = Rh 



(u l ,u 2 ...u p )= Rh(u) and R * = g(p,u) 



32 




Sampling Strategies for Finite Population Using Auxiliary Information 



such that / 2 (c) = 1 and g(R,c)=R, where h(u) and g[R,u) satisfying some responding 
conditions. 

Further, Khare (1993(a)) has proposed a class of estimators for R using multi- 
auxiliary characters with unknown means, the class of estimators is given as follows: 

K=g[kii), (7) 

f 

such that g(R,c) = 1 , where w, = — , td = {u v u 2 ..u ), x i and x, are sample mean based on n 

x ; 

and n (> n) units for auxiliary characters x ( - , i = 1,2 



Similarly, Khare (1992) have proposed class of estimators for P using p auxiliary characters 
with known and unknown population mean and studied their properties. 

Further, Khare (1990) has proposed a generalized class of estimator for a combination 
of product and ratio of some population means using multi-auxiliary characters. The 
parametric combination is given by: 



0 = 



Y lt Y 2 ,Y^Y m 



Y Y Y Y 

m + 1 ’ m+2 ’ m + 3 



( 8 ) 



which is the product of first m population means Y l ,Y 2 ,Y 3 ,...,Y m divided by product of k-m 
population means Y m+l ,Y m+2 ,Y M+3 ,...,Y k respectively. The conventional estimator for 6 is 
given by 

q _ y 1 .y 2 .y 3 >-> y>n 

y m + 1 ’ y m+2 ’ y m + 3 y k 

It is important to note that for m = \,k = 2\ 0 = R 

m = 2, k = 2; 0 - P 

m — \,k = 1; 9 = Y l 

m = k = l',Y 1 =Y 2 , 6 = Y{, 

m = k = 2>\Y X = Y 2 =Y 3 , 6 = Y l \ 

m = 2,k = 4;Y l = Y 2 ,Y 3 = Y A , 0 = Y{=R 2 , 

Using p auxiliary characters x, , x 2 , ..., x p with known population means X l ,X 2 , ..., X p the 
class of estimators 6 is given by: 

G* = 6h{u ) , ( 10 ) 
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such that h(e) = 1 , where ip = (u { ,u 2 ,...u p ) and «. = i = 1, 2, p . 

X i 

The function h(u l ,u 2 ,...u p ) = h(u ) satisfied the following regularity conditions: 

a) Whatever be the sample chosen (u), assume values in abounded closed convex sub 

set G of p dimensional real space containing the point u-e. 

b) In G , the function h(u) is continuous and bounded. 

c) The first and second partial derivatives of h{u) exists and are continuous and bounded 

inG . 

For two auxiliary variables it is found that the lower bond of the variance of the class of 
estimators 0 is same as given by the estimators proposed by Singh (1969) and Shah and 

s): 

Shah (1978). Hence it is remarked that the class of estimators 6 will attain lower bound for 
mean square error if the specified and regularity conditions are satisfied. 

Further, Khare (1993b) have proposed the class of two phase sampling estimators for 
the combination of product and ratio of some population means using multi-auxiliary 
characters with unknown population means, which is given as follows: 

0** - 0h(v ) , (11) 

t 

where y = , v,. = i = 1,2 ,...p . 

X i 

Such that h(e)= 1 and h(v) satisfies some regularly conditions. 

Case 2. Incomplete Response in the Sample due to Non-response: 

In case of non-response on some units selected in the sample, Hansen and Hurwitz 
(1946) have suggested the method of sub sampling from non-respondents and proposed the 
estimator for population mean. Further, Khare et al. (2014) have proposed some new 
estimators in this situation of sub sampling from non-respondents. 

Khare & Pandey (2000) and Khare & Sinha (2010) have proposed the class of estimators for 
ratio and product of two population means using auxiliary character with known population 
mean in the presence of non-response on the study characters, which is given as follows: 

R* - R h(u [ ) and P* = P*h(u i ) , i - 1,2 , (12) 

* * 

such that h( l) = 1 , where P = y*y * 2 , u x =^=, u 2 =^= and y,*, y * and x* are 

y 2 X X 

sample means for y, , y 2 and x characters proposed by Hansen and Hurwitz (1946) based 
on n x +r units and T is the sample mean based on n units. Khare & Sinha (2012) have 
proposed a combined class of estimators for ratio and product of two population mean in the 
presence of non-response with known population mean X . This is a more general class of 
estimators for R and P under some specified and regularity conditions. Khare et al. (2013 
(a)) have proposed an improved class of estimators for R. In this case, the improved class of 
estimators for R using auxiliary character with known population mean X in the presence of 
non response is given as follows: 
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R i =g(v,u i )i = 1,2, (13) 

* 

such that g(i?,l)=i?, (/?,l) and g 17 (i?,l) = R _1 g 2 (R,l), where v = ^, w^^and 

y 2 

u 2 — x . The function g(v,u,) i = 1,2 assumes positive values in a real line containing the 
point (R, l) . The function g(v,n ; ) is assumed to be continuous and bounded in a real line and 
its first and second order partial derivatives exists. The first partial derivative of 
g(v,n ; ) 1 = 1,2 at the point (R, l) with respect to v and w, is denoted by g, (A\l) and g 2 (i?,l). 
The second order partial derivative of g(v, n, ) 1 = 1,2 with respect to v , v and , and u i at the 
point (R, l) is denoted by g u {R, l), g 12 (.R,l) and £ 22 (^ 1 ) respectively. Some members of 
the class of estimators R, are given as follows: 

C { = w 0 vu "‘ , C 2 = W|V + w 2 Uj , C 3 = w[v + w' 2 vuf‘ , i = 1,2 , (14) 

where w 0 , w 1 , w 2 , w[, w 2 , a - and /i, 1 = 1,2, are constants. Further the class of estimator 

proposed by Khare and Sinha (2013) is more efficient than the estimator proposed by Khare 
and Pandey (2000). 

Further, Khare and Sinha (2002(a, b)) have proposed two phase sampling estimators for ratio 
and product of two population means in the presence of non-response. Khare and Sinha 
(2004(a,b)) have proposed a more general class of two phase sampling estimators for R and 
P. which are given as follows: 

T i =g{v^i), 1 = 1,2, (15) 

* * 

such that g(R,\) = R and gj(/?,l) = 1 , where v = -^4, u { =^, w 2 =^- and x is sample mean 

y 2 x x' 

based on n (> n) units. The function g(v,w, ) satisfy some regularly conditions. 



=g{wMj), 1 = 1 , 2 , 

_* 

such that g(/\l) = l and g 1 (P,l) = l, where w^y^y^, u { = — 

x' 

some regularly conditions. 



( 16 ) 

u 2 and g{w,Uj) satisfy 
x 



Khare et al. (2012) have proposed two generalized chain type estimators T gl and T g2 for R 
using auxiliary characters in the presence of non-response, which are given as follows: 




where R = ^r and («,,«,) and are suitable constants. It has been observed that 

y 2 

due to use of additional auxiliary character with known population mean along with the main 
auxiliary character, the proposed class of estimators T gl and T g2 are more efficient than the 
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corresponding generalized estimators for R using the main auxiliary character only in the 
case of two phase sampling in the presence of non response for fixed sample sizes ( n',n ) and 

also for fixed cost (C < C 0 ). It is also seen that less cost is incurred for and T ,2 than the 
cost incurred in the generalized estimator for R in the case of two phase sampling in the 
presence of non response for specified precision ( V = V Q ). 

Further, generalized chain estimators for ratio and product of two population means have 
been improved by putting R =k { R and P -k^P in place of R and P in the proposed 
estimators of R and P . Further, Khare et al. (2013 (b)) have proposed the improved class of 
chain type estimators for ratio of two population means using two auxiliary characters in the 
presence of non-response. The class of estimators is given as follows: 

R ci =f(R,u i ,v),i = 1,2, (18) 

such that /(R,l,l) = l and /j (R,l,l) = 1 , where R = -^r, u x = 4-, u 2 -f- and v = = . The 

y, x x' Z 

function /(ft,w ; ,v), i- 1,2 satisfies some regularity conditions. 

Khare and Sinha (2007) have proposed estimator for R using multi-auxiliary characters with 
known population mean in the presence of non-response. The class of estimators t, is given 
as follows: 

t i =Rg i (u[),i = 1,2, (19) 

such that g i (e-) = 1, where u, and e, denote the column vectors (u a ,u i2 , ... ,u ip )' and 

_* _ 

X: X: 

(1, 1, ... ,1)', u x , = ^and u 2j = ^=- j = 1,2,..., p. 

x i x , 

An improved under class of estimators for R using multi-auxiliary variables using double 
sampling scheme in the presence of non-response has been proposed by Khare and Sinha 
(2012) and studies their properties. 

Khare and Sinha (2014) have extended the class of estimator proposed by Khare and Sinha 
(2012) and proposed a wider class of two phase sampling estimators for R using multi- 
auxiliary characters in the presence of non-response. 
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Abstract 

In this paper the use of coefficient of variation and shape parameters in each stratum, the 
problem of estimation of population of mean has been considered. The expression of mean 
squared error of the proposed estimator is derived and its properties are discussed. 

Keywords Auxiliary information, MSE, coefficient of variation, stratum, 
shape parameter. 



Introduction 

The use of prior information about the population parameters such as coefficient of 
variation, mean and skewness and kurtosis are very useful in the estimation of the population 
parameter of the study character. In agricultural and biological studies information about the 
coefficient of variation and the shape parameters are often available. If these parameters 
remain essentially unchanged over the time than the knowledge about them in such case it 
may profitably be used to produce optimum estimates of the parameters (Sen and Gerig 
(1975)). Searls (1964, 67) and Hirano (1972) have proposed the use of coefficient of variation 
in the estimation the population mean. Searl and Intarapanich (1990) have suggested the use 
of kurtosis in the estimation of variance. Sen (1978) has proposed the estimator for 
population mean using the known value of coefficient of variation. 

In Stratified random sampling, the theory has been developed to provide the optimum 
estimator 7) of the population mean based on sample mean from each stratum. We extend it 
by constructing an estimator T 2 using the coefficient of variation C, and shape parameter 
P\ L , Pn (i = 1,2,— K) from each stratum and discuss its usefulness. We also define estimators 
T 3 and T 4 when the coefficients of variation are unknown but shape parameters are known 
and when neither the coefficients of variation are known nor the shape parameters are known. 

Estimators and their Mean Square Error 

Let Nj denotes the size of the i th stratum and n ( denotes the size of the sample to be 
selected from the i th stratum and h be the number of strata with 
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h h 

tNj = N and Ztij = n , (1) 

Z=1 1=1 

where N and n denote the number of units in the population and sample respectively. 

Let y- t j be the j th unit of the i th stratum. Then the population mean Y N can be 
expressed as 

Y N = W, (2) 

i = 1 



n; — , 

where p, = — and Y i is the population mean for the i stratum. 

Let n t units be selected from the i th stratum and the corresponding sampling mean and 
sample variance be denoted by y i and s t respectively. Then the estimate of Y N is given by 



T i = 2 PiYi 
i = 1 



and the 



h 2 2 

V(Tj)=Z— — — (if f.p.c is ignored), 

i=l n ; 



where erf is the population variance of y in the i th stratum. 

Case 1: Coefficient of variation and the shape parameters are known. 

We defined 



T 2 = Tp l {a i y l +(1 -a i )C i 1 V sf 

i = 1 



and expectation of T 2 is given by 



h - - Wist) 

E(T 2 ) = ’L Pi {a j Y i +(l-a i )Y i (l~- — )} 



h - 1 Visf) 

= T Pi {Y t - (1 - a t )(- — ) } 
/=! 8 a t 



= Tj Pi {Yf — — - H I 

(=1 o n ; - n(n - 1) 



= Y N +0(-) 
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MSEd,)^ 



= X Pl (7 ‘ { @2L — hi — 1 } + 0(— *— ) 

'=1 n i [i/hi-fiti- D + (V^-2Q) 2 n 3 ' 2 



(9) 



The value of a iopt will be less than one for J /3 U < 2 C i , which implies that the 
distribution is near normal, poison, negative binomial and Neyman type I. The value of a iopt 
will be equal to one for J = 2 C ; , which is true for gamma and exponential distribution. 
The value of a igpt will be greater than one for A //? l 7 > 2 C, , which is likely to the distribution 

of lognormal or inverse Gaussian. It is easy to see that T 2 will always be more efficient than 
7j if ^/7?|7 < 2 Ci or V'a 7 > 2C ; - , justifying the use of T 2 in the case of near normal, poison, 
negative binomial, Neyman type I and lognormal or inverse Gaussian distribution. T 2 is 
equally efficient J \ , if = 2C, and so for in gamma or exponential distribution one may 
use 7j or T 2 . This shows that proposed estimator T 2 is uniformly superior to the estimator 
7j , though a comparatively high efficiency may be seen in near normal, poison, negative 
binomial than lognormal or inverse Gaussian distribution. 
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r ' s R 's R ' s 

Case 2: ! are unknown, / h and / 2/ are known. 

When C/'s are unknown, we use their estimates c ( - based on a larger sample of size 
n\ from a previous occasion. Now we define an estimator T 3 for Y N given by 



T 3 = ip/ {■ a\y / + (1 - a\ )c f 1 
i=l 



The mean square error of the estimator T 3 as given by 



mse(t 2 /«;) = i [«; 2 + «; (i - )cr' #i7 + a - «; ) 2 cr 2 { (A 2; -i) + 4/qcr 2 v( C/ )} , 
(=1 



where V(c , ) = % { (/?,, - 1) - } = %(&, - 1) . 

n , M2Ml //] «/ 



The optimum value of a- is given by 



(^2/-A,--l) + 4w,-Cr 2 V(c f ) 

(A, -A/ - 1 + 4/7, cy 2 y (c, » + ( Va7 - 2Q ) : 



It is easy to see that 



MSEtT la' ) = (P 2i - P u -l) + An i C; 2 V{c i ) 

1 '=' "« (Ay - A/ - 1 + 4 / 1 ,-Cf 2 V(c f )) + - 2Q ) 2 



It may be remarked that (13) differs from (9) by a single term 4 ti/C, V(c/) both in 
numerator and denominator. The nature of the estimator T 3 is similar to T 2 and its MSE will 

V(c/) n 

converge to MSE(T~> ) for — > 0 . 

cr 

Case 3: ‘' ! s , s and s are unknown: 

When C/'s , {3 u 's and /T,'s are not known then they can be estimated on the basis 
of a larger sample of size n\ » ...n l from the past data and we may have the estimator for the 
population mean Y N given by 



T 4 = ip l {a i y l + (\-a i )c l l ^sf } , 

i = 1 



42 




Sampling Strategies for Finite Population Using Auxiliary Information 



where “i op, = 



ht-iCjJKi- 1 
4C? + 4C, VA7 + A/ - 1 



It is easy to see that the MSE(T 4 ) will be same as MSE(T 3 ) because after estimating 
the unknown parameters in the constant a iopt , the MSE will remains unchanged up to the 

terms of O (n -1 ) (Srivastava and Jhajj (1983)). 
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Abstract 

In this paper, a study of improved chain ratio-cum regression type estimator for population 
mean in the presence of non-response for fixed cost and specified precision has been made. 
Theoretical results are supported by carrying out one numerical illustration. 

Keywords Simple random sampling, non response, fixed cost, precision. 

Introduction 

In the field of socio, economics, researches and agricultures the problem arises due to 
non-response which friendly occur due to not at home, lack of interest, call back etc. In this 
expression a procedure of sub sampling from non respondents was suggested by Hansen and 
Hurwitz (1946). The use of auxiliary information in the estimators of the population 
parameters have helped in increased the efficiency of the proposed estimator. Using auxiliary 
character with known population mean of the estimators have been proposed by Rao 
(1986,90) and Khare and Srivastava (1996,1997). Further, Khare and Srivastava 
(1993, 1995), Khare et al. (2008), Singh and Kumar (2010), Khare and Kumar (2009) and 
Khare and Srivastava(2010) have proposed different types of estimators for the estimation of 
population mean in the presence of non-response in case of unknown population mean of the 
auxiliary character. 

In the present paper, we have studied an improved chain ratio-cum-regression type 
estimator for population mean in the presence of non-response have proposed by Khare and 
Rehman (2014) in the case of fixed cost and specified precision. In the present study we have 
obtained the optimum size of first phase sample (n) and second phase sample (n ) is drawn 
from the population of size N by using SRSWOR method of sampling in case of fixed cost 
and also in case of specified precision V = V 0 . The expression for the minimum MSE of the 
estimator has been obtained for the optimum values of n' and n in case of fixed 
cost C <C Q . The expression for minimum cost for the estimator has also been obtained in 
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case of specified precision V = V 0 . An empirical study has been considered to observe the 
properties of the estimator in case of fixed cost and also in case of specified precision. 



The Estimators 

Let Y , X and z denote the population mean of study character y , auxiliary character x 
and additional auxiliary character z having jth value K ; , X . and Z ; : j = 1.2,3,.... N . 

Supposed the population of size N is divided in N 1 responding units and N 2 not responding 
unit. According to Hansen and Hurwitz a sample of size n is taken from population of size 
A by using simple random sampling without replacement (SRSWOR) scheme of sampling 
and it has been observed that n l units respond and n 2 units do not respond. Again by making 

extra effort, a sub sample of size r{-n 2 k _1 ) is drawn from n 2 non-responding unit and collect 
information on r units for study character y . Hence the estimator for Y based on n l + r units 
on study character y is given by: 



* Yl\ 

y = — }’[ 

n 



where n ] and n 2 are the responding and non-responding units in a sample of size n selected 
from population of size N by SRSWOR method of sampling, y, and y 2 are the means based 
on n x and r units selected from non-responding units by SRSWOR methods of sampling. 



Similarly we can also define estimator for population mean X of auxiliary character x based 
on /?, and r unit respectively, which is given as; 



x - — X j 
n 



n ^-x^x ' 2 
n n 



Variance of the estimators y* and x*are given by 



V(f) = ts 2 
n 



W 2 (k — 1) 2 



V(x*) = l-s 2 x 



w 2 (k- l) p2 



where f = 1- — ,W 2 = , (S 2 y ,S 2 y(2) ) and (S 2 ,S; (2) ) are population mean squares of y and 

.r for entire population and non-responding part of population. 

In case when the population means of the auxiliary character is unknown, we select a 
larger first phase sample of size n' units from a population of size N units by using simple 
random sample without replacement (SRSWOR) method of sampling and estimate X by %' 
based on these units Further second phase sample of size n (i.c. n <n' ) is drawn from 
n units by using SRSWOR method of sampling and variable y under investigation is 
measured «, responding and n 2 non-responding units. Again a sub sample of size 
r(n 2 /k,k> 1) is drawn from n 2 non-responding units and collect information on r units by 
personal interview. 
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In this case two phase sampling ratio, product and regression estimators for population 
mean Y using one auxiliary character in the presence of non-response have been proposed by 
Khare and Srivastava (1993,1995) which are given as follows: 



rw-, 

T\ =y 3T 

X 



T 2 = y +b*(x' - x*) 



where x* = '^-x 1 + J ^x' 2 , x = -^Xj ,x , = ^ 7 ^xj,b* = -^- , s* = — *— ^(x ; . -x) 2 
n n n “f n “ 7 V 2 n-\~t 



S yx and S] are estimates of S yx and S x based on n, + r units. 

The conventional and alternative two phase sampling ratio type estimators suggested by 
Khare and Srivastava (2010) which are as follows: 



rr -* X 

t 3 =y — . 

X 



JxY 



T 4=y - 
\x 



where 



a and a' are constants. 



Singh and Kumar (2010) have proposed difference type estimator using auxiliary 
character in the presence of non-response which is given as follows: 




where c^and a 2 are constants. 

In case when x is not known than we may use an additional auxiliary character z with 
known population meanz with the assumption that the variable z is less correlated to 
y than x i.e, ( p yz < P yx ), x an( ^ Z are variables such that z is more cheaper thanx. 

Following Chand (1975), some estimators have been proposed by Kiregyera (1980,84), 
Srivasatava et al. (1990) and Khare & Kumar (2011). In the case of non-response on the 
study character, the chain regression type and generalized chain type estimators for the 
population mean in the presence of non-response have been proposed by Khare & Kumar 
(2010) and Khare et al. (2011). An improved chain ratio-cum-regression type estimator for 
population mean in the presence of non-response have been proposed by Khare & Rehman 
(2014), which is given as follows: 

T 6 = y*(^) +b yv (x -x*)+b xz (z -z') (9) 

\ x J V z J 

where p and q are constants. b yx and b xz are regression coefficients. Z and z population 

mean and sample mean based on first phase sample of size n units selected from population 
of size N by SRSWOR method. 
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Mean Square Errors of the Study Estimator 

Using the large sample approximations, the expressions for the mean square errors of 
the estimator proposed by Khare & Rehman (2014) up to the terms of order (n 1 ) are given 
by 



w( 


1 


4]{ 




v n 


77 A 



72 p 2 c; +br x X 2 C; -2Y 2 pC yx -2XYb yx C yx +2 XYb yx pC 2 



]{y 

w 2 (k- l)f- 



Y 2 q 2 C 2 z +}fz 1 C 2 - 2 Y 2 qC yz - 2 YZb xz C yz + 2 YZqb xz C 2 } 



+ — ’-{y 2 P 2 C 2 x(2) + b 2 x X 2 C 2 (2) - 2Y 2 pC yx(2) - 2XYb yx C yx(1) + 2XYb yxP C 2 x(2) } ( 1 0) 



The optimum values of p and q and the values of regression coefficient are given as follows: 

[---] ( f C w -Xb yx C 2 x ) + W ^ k ~ ] \ YC yx(2) -Xb yx Cl (1) ) 

n =xH 1 (ii 

(1 iVrf.f.Nirrf 



'-fc„ - ( y C m 2> -Xb yk .C; m ) 



n n ) 



tfopt 



YC yz -Zb xz Cz 



Kp C v 

b and 

y X C r 



u _ X P ,z C , 

yx z c z 



Mean square errors of the estimators Ij , 7) , T 3 , T 4 and T 5 are given as follows: 
MSE(T, ) m ,„ = V(f ) + Y f f-t - 4 ](C; - 2 C„,)+ W ^ k - h (c;, a - 2 C„j] ( 14 ) 

V 77 77 



* —9 ( 1 1 ) 9 9 Wo(k — 1) (99 

MSE(T 2)m - in =V(y )-Y 2 \---\p 2 x C 2 + j fi 2 C_J (2) - 2B C yxa) 



MSE(T 3 ) min = V(y )-Y- 



1 1 V w 2 (k- 1) 

77 ,7'J W ,7 W(2) 



«W 4 )„„ = V(f ) - r ! ( i - Ike; 

U n J ' 



-j f' f 1 lh , , WJk-l) , , 

MSE(T 5 ) min =Y 2 J -C]+ 7 (1 -p 2 yx )C; + ^ ^(1-^ ( 2 ) )C 2 (2) 

Tl \YlYl) Yl 



where V(y ) = Y \—C 



(72 if , W 2 (k-l) 2 



Y Py,C y 



and B - — =- 

XC 
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Determination of n ,n anc * ^ for the Fixed Cost c ~ c ° 

Let us assume that C 0 be the total cost (fixed) of the survey apart from overhead cost. 
The expected total cost of the survey apart from overhead cost is given as follows: 



C = (e\ + e' 2 )ri + n^e l + eJN\ + e 3 j , (19) 

where 

e[ : the cost per unit of obtaining information on auxiliary character x at the first phase. 

e' 2 : the cost per unit of obtaining information on additional auxiliary character z at the first 

phase. 

e x : the cost per unit of mailing questionnaire/visiting the unit at the second phase. 
e 2 : the cost per unit of collecting, processing data obtained from responding units. 
e 3 : the cost per unit of obtaining and processing data (after extra efforts) for the sub 
sampling units. 

The expression for, MSE(T 6 )c an be expressed in terms of D Q ,D l ,D 2 and D 3 which are the 

coefficients — , — , — and — respectively. The expression of MSE(T 6 ) is given as follows: 
n n' n N 

• (20) 

n n n TV 

For obtaining the optimum values of n' ,n,k for the fixed cost C < C 0 , we define a function 
(f> which is given as: 

<!> = MSE(T 6 ) min + T(C - C 0 ) , (21) 

where X is the Lagrange’s multiplier. 

We differentiating (f> with respect to n' ,n,k and equating zero, we get optimum values of 
n' , n and k .which are given as follows: 



A(e[ + e' 2 ) 



n opt = 



\Dq + k opl D 2 j 



A e j + c 2 Wj + e 3 
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The minimum value of MSE(T b ) for the optimum values of ri , n and kin the expression 
MSE(T 6 ) , we get: 

I 2 

(D 0 + K p ,D 2 )\ e x + e 2 W t + e 3 ^ , (26) 

v K-opt y N 

Now neglecting the term of O ( N ~ x ), we have 



1 i w 

MSE(T 6 ) min = — jDM+e' 2 ) + (D 0 + k opt D 2 ) e, + e 2 W, + e 3 -±- 

V V ^° p 1 



Determination of n',n and k for the Specified Precision v =v 0 

Let Vj be the specified variance of the estimator T 6 which is fixed in advance, so we 



+ , ( 28) 

n ri n N 

To find the optimum values of n',n,k and minimum expected total cost, we define a 
function \j/ which is give as follows: 



¥ = (e[ + e' 2 )ri + nl e 1 + e 2 W, + e 3 —\ + ju(MSE(T 6 ) min - LJ) , (29) 

where /j is the Lagrange’s multiplier. 

After differentiating <// with respect to ri ,n, k and equating to zero, we find the optimum 
value of n',n and/c which are given as; 

"•-JS ■ <3o) 

I _ on 

w w 2 

1 e 1 +e 2 W 1 +e 3 — 

V K °p< ) 



k = P Q W 2 e 3 
° P ‘ \P 2 (e,+e 2 W,) 

where 



/A (4 + 4) + J(D 0 +k opt D 2 ) 



l: + ° 3 



The minimum expected total cost incurred on the use of T b for the specified variance V Q will 
be given as follows: 
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C, 



^D l (e[+e' 2 )+ l(D 0 +k opt D 2 ) I 



( bO 

e \ +e 2 W l +e, 

k o P t ) 



v 0 '+— 

N 



Now neglecting the terms of O ( N 1 ), we have 



C, . = 

omin 



yl D i(e[ +e' 2 ) + (D 0 + k opl D 2 ) £ 



A 



W ^2 

<?i + e 2 IT, + Us — 

V 7 



Vo' 



(34) 



(35) 



Ah Empirical Study 

To illustrate the results we use the data considered by Khare and Sinha (2007). The 
description of the population is given below: 

The data on physical growth of upper socio-economic group of 95 schoolchildren of 
Varanasi under an ICMR study, Department of Pediatrics, B.H.U., during 1983-84 has been 
taken under study. The first 25% (i. e. 24 children) units have been considered as non- 
responding units. Here we have taken the study variable (y) , auxiliary variable (x) and the 

additional auxiliary variable (z) are taken as follows: 
y : weight (in kg.) of the children. 

x : skull circumference (in cm) of the children. 

Z ■ chest circumference (in cm) of the children. 

The values of the parameters of the y,X and z characters for the given data are given as 
follows: 

Y = 19.4968 , Z =51.1726 , X =55.8611, C y =0.15613, C z =0.03006, C x =.05860, 
C y(2) -0.12075, C_ (2) = 0.02478 , C x(2) =0.05402, p yz = 0. 328, p yx =0.846, p xz =0.297, 

P XZ ( 2 )= 0-570, W 2 =0.25, W l = 0.74, N = 95, n = 35 

Table 1. Relative efficiency (in %) of the estimators with respect to y(for the fixed cost 
C < C 0 =Rs.220, c[=Rs. 0.90, c' 2 =Rs. 0.10, c x =Rs. 2, c 2 =Rs. 4, c 2 =Rs. 25). 



Estimators 


K P , 


r 

n opt 


n o P < 


Efficiency 


* 

y 


2.68 


— 


30 


100 (0.3843)* 


T x 


2.89 


58 


23 


117 (0.3272) 


t 2 


2.03 


74 


19 


131 (0.2941) 
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h 


2.61 


81 


20 


155 (0.2473) 


T 4 


1.06 


76 


14 


136 (0.2819) 


T 5 


2.68 


81 


20 


157 (0.2453) 


T, 


2.67 


68 


21 


166 (0.2315) 



*Figures in parenthesis give the MSE (.). 

From table 1, we obtained that for the fixed cost C < C 0 the study estimator T b is more 
efficient in comparison to the estimators y * , 7', , T 2 , T-, , T 4 and T 5 . 

Table 2. Expected cost of the estimators for the specified variance Vj = 0.2356 : ( c[ =Rs. 0.90, 
c' 2 =Rs. 0.10, c l =Rs. 2, c 2 =Rs. 5, c 3 =Rs. 25) 



Estimators 


k opl 


K P , 


V 


Expected Cost 
(in Rs.) 


* 

y 


2.68 


— 


61 


502 


7 | 


2.89 


107 


40 


418 


t 2 


2.03 


115 


25 


332 


h 


2.61 


88 


20 


246 


T 4 


1.06 


92 


16 


275 


t 5 


2.68 


87 


21 


244 


T, 


2.67 


69 


20 


231 



From table 2, we obtained that for the specified variance the study estimator 7 6 has less cost 
in comparison to the cost incurred in the estimators y * , 1\ , T 2 , T-, , T 4 and T 5 . 
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Conclusion 

The information on additional auxiliary character and optimum values of increase the 
efficiency of the study estimators in comparison to corresponding estimators in case of the 
fixed cost C < C 0 and specified precision V =V 0 . 
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The present book aims to present some improved estimators using auxiliary and 
attribute information in case of simple random sampling and stratified random sampling 
and in some cases when non-response is present. 

This volume is a collection of five papers, written by seven co-authors (listed in 
the order of the papers): Sachin Malik, Rajesh Singh, Florentin Smarandache, B. B. 
Khare, P. S. Jha, Usha Srivastava and Habib Ur. Rehman. 

The first and the second papers deal with the problem of estimating the finite 
population mean when some information on two auxiliary attributes are available. In the 
third paper, problems related to estimation of ratio and product of two population mean 
using auxiliary characters with special reference to non-response are discussed. 

In the fourth paper, the use of coefficient of variation and shape parameters in 
each stratum, the problem of estimation of population mean has been considered. In the 
fifth paper, a study of improved chain ratio-cum-regression type estimator for population 
mean in the presence of non-response for fixed cost and specified precision has been 
made. 

The authors hope that the book will be helpful for the researchers and students that 
are working in the field of sampling techniques. 
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